Sina Weibo Incident Monitor and Chinese Disaster Microblogging

نویسنده

  • Hua Bai
چکیده

This paper describes the initial work on developing an all-hazards emergency event detector using messages obtained in near-real-time from the public timeline of the Chinese Sina Weibo microblogging service. The system filters target keywords corresponding to emergency events of earthquakes, floods, typhoons, fires and storms and then uses classifiers to identify messages from people experiencing the corresponding emergency event. Then, this study carried out experiments that compare the performance of four different classification methods and also explore to improve the classifier by the new training data captured by SWIM recently. After Chinese text pre-processing, feature selection and training set size, the experimental results demonstrate Random forests classifier could get best performance but need more long time to run in R, thus the potential to improve this classifier for setting up the SWIM system need to be explored in the future. While similar work has been reported using Twitter content, this is the first time these techniques have been applied to the Sina Weibo microblogging service for multiple emergency event types. This paper also outline the experience of accessing Sina Weibo messages, provide a summary of their structure and content, note the challenges faced in processing this text using Natural Language Processing packages and outline the developed website for users to view the processed messages. The long term aim is to develop a general emergency notification and monitoring system for various disaster event types in China reported by the public on Sina Weibo which can be used by the appropriate emergency services as a source of improved situational awareness. Subject Categories and Descriptors H.2.8 [Database Applications]: Data mining; K.4.1 [Public Policy Issues]: Privacy Hua Bai 1*, Xunguo Lin 2, Bella Robinsion 2, Robert Power 2 1School of Management, Harbin Institute of Technology Heilongjiang, 150001, China 2CSIRO Digital Productivity Flagship G.P.O. Box 664, Canberra, ACT 2601, Australia baihua1727@163.com General Terms: Text Mining , Chinese Text Classify

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a Sina Weibo Incident Monitor for Disasters

This paper presents ongoing work to develop an earthquake detector based on near-real-time microblog messages from China. The system filters earthquake related keywords from Sina Weibo messages available on the public timeline and uses a classifier to determine if the messages correspond to people experiencing an earthquake. We describe how the classifier has been established and report prelimi...

متن کامل

Exploring Health Topics in Chinese Social Media: An Analysis of Sina Weibo

This paper seeks to identify and characterize healthrelated topics discussed on the Chinese microblogging website, Sina Weibo. We identified nearly 1 million messages containing health-related keywords, filtered from a dataset of 93 million messages spanning five years. We applied probabilistic topic models to this dataset and identified the prominent health topics. We show that a variety of he...

متن کامل

Topical differences between Chinese language Twitter and Sina Weibo

Sina Weibo, China’s most popular microblogging platform, is currently used by over 500M users and is considered to be a proxy of Chinese social life. In this study, we contrast the discussions occurring on Sina Weibo and on Chinese language Twitter in order to observe two different strands of Chinese culture: people within China who use Sina Weibo with its government imposed restrictions and th...

متن کامل

To See and to Be Seen: Chinese White-Collar Workers' Interpretation of Microblogging and Social Capital

In recent years, microblogging has gained enormous popularity in China, especially among urban professional workers. This phenomenological study investigates how white-collar workers in China experience microblogging and how they perceive the impact of microblogging on their lives. Twenty in-depth face-to-face interviews were conducted in Beijing and Qingdao with young white-collar professional...

متن کامل

Artificial Inflation: The True Story of Trends in Sina Weibo

There has been a tremendous rise in the growth of online social networks all over the world in recent years. This has facilitated users to generate a large amount of real-time content at an incessant rate, all competing with each other to attract enough attention and become trends. While Western online social networks such as Twitter have been well studied, characteristics of the popular Chines...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015